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RAW SEQUENCE LISTING DATE: 02/11/2002 

PATENT APPLICATION: US/09/939 , 537 TIME: 09:33:06 

Input Set : N:\Crf3\RULE60\09939537.txt 
Output Set: N:\CRF3\02112002\I939537.raw 

SEQUENCE LISTING 

C--> 5 (1) GENERAL INFORMATION: 

7 (i) APPLICANT: Seed, Brian 

8 Banapour / Babak 

9 Romeo, Charles 

10 Kolanus, Waldemar 

C--> 12 (ii) TITLE OF INVENTION: TARGETED CYTOLYSIS OF HIV-INFECTED 

13 CELLS BY CHIMERIC CD 4 RECEPTOR- BEARING CELLS 

15 (iii) NUMBER OF SEQUENCES: 53 

17 (iv) CORRESPONDENCE ADDRESS: 

18 (A) ADDRESSEE: Clark & Elbing LLP 

19 (B) STREET: 176 Federal Street 

20 (C) CITY: Boston 

21 (D) STATE: MA 

22 (E) COUNTRY: USA 

23 (F) ZIP: 02110 
2 5 (v) COMPUTER READABLE FORM: 
26 (A) MEDIUM TYPE: Diskette 
2 7 (B) COMPUTER: IBM Compatible 

28 (C) OPERATING SYSTEM: DOS 

29 (D) SOFTWARE: FastSEQ for Windows Version 2.0 
31 (vi) CURRENT APPLICATION DATA: 

C--> 32 (A) APPLICATION NUMBER: US/09/939,537 

C--> 33 (B) FILING DATE: 24-Aug-2001 

39 (C) CLASSIFICATION: 

C--> 41 (vii) PRIOR APPLICATION DATA: 

37 (A) APPLICATION NUMBER: 08/284,391 

38 (B) FILING DATE: 02-AUG-1994 

42 (A) APPLICATION NUMBER: 08/195,395 

43 (B) FILING DATE: 14-FEB-1994 

45 (A) APPLICATION NUMBER: 07/847,566 

46 (B) FILING DATE: 06-MAR-1992 

48 (A) APPLICATION NUMBER: 07/665,961 

49 (B) FILING DATE: 07-MAR-1991 

51 (viii) ATTORNEY/AGENT INFORMATION: 

52 (A) NAME: Elbing, Karen L 

53 (B) REGISTRATION NUMBER: 35,238 

54 (C) REFERENCE/DOCKET NUMBER: 00786/247001 

56 (ix) TELECOMMUNICATION INFORMATION: 

57 (A) TELEPHONE: 617-428-0200 

58 (B) TELEFAX: 617-428-7045 

59 (C) TELEX: 
62 (2) INFORMATION FOR SEQ ID NO : 1: 
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64 (i) SEQUENCE CHARACTERISTICS: 

65 (A) LENGTH: 1728 base pairs 

66 (B) TYPE: nucleic acid 

6 7 (C) STRANDEDNESS : double 

6 8 (D) TOPOLOGY: linear 

70 (ii) MOLECULE TYPE: cDNA 

72 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1: 



7 A 


a Tr a a Hence 


vjAVj X V—L-V— XXX 


XAVjVjL-A\_X X\J 


L-X IUIoVjIuL 


XVjV-AAC Xvj^jC 


r , r ,r prr ,r ?pppa 

VjL- X ttltLL-A 


D V 




c par r*His L r ,r pr t 


ALjovjAAAwAA 


AVj 1 X VjL- X \j 


ouUAAAAAAIj 


Vj^jVjA X AL-Atj X 


^(ji AA t X <c» ALL 


ion 




x\j X AL-A<jrL- X JL 




PAfT'ATAr'AA 


J. 1 LLAL. X VjLtA 


aaa arTrra a 

AAAAL. X tLAA 


pfAr 1 at* a aar 

LLAuA X AAAVj 


i fin 


7 7 




A -L L.A*jrvjrVj^_ 1 




AAAVjVj X L-^- A 1 


L-L-AAVjV— XkjAA 


X Vj A X LVjV—VjL X 


Z H U 


7 P 


uAL 1 L-AAVj AA 


uAAbLLl X Xkj 


\j \j A L* A A Vj vj A 


AAt X X Ltttt 


T 1 *" A IT 1 A •Pf"' A A 
X \j A X tA X tAA 


r* A A TTT^A A C 
oAA AAu 


•3Af\ 
jUU 


7 Q 


A 1 ALt AAQj AL- 1 


/"< A A rp a pmrn a 
LAbAlAtl XA 


/"•a m/Tp/Tp^* a A 
C A i C 1 J. (j AA 


r* rnr* r* a /""/"A O/™" 
XCjOACjCjACC 


A/^AA/^PA/^/^A 

ACj A ACj ACj A 


r* r* rp/" 1 A A rn rp/"> 
IbLAA X X (j 




Q A 

oU 


C 1 A(j X (j X X CCj 


Lr A 1 1 \j At lbt 


tAAL IL1 CjAv- 


a /"•/"■/"'A PPTrr" 


X XCAOkatatjL-A 


(j AvjCC- 1 \y ACQ, 


/ion 
4 z U 


81 


CTGACCTTGG 


AGAGCCCCCC 


TGGTAGTAGC 


CCCTCAGTGC 


AATGTAGGAG 


TCCAAGGGGT 


480 


82 


AAAAACATAC 


AGGGGGGGAA 


GACCCTCTCC 


GTGTCTCAGC 


TGGAGCTCCA 


GGATAGTGGC 


540 


83 


ACCTGGACAT 


GCACTGTCTT 


GCAGAACCAG 


AAGAAGGTGG 


AGTTCAAAAT 


AGACATCGTG 


600 


84 


GTGCTAGCTT 


TCCAGAAGGC 


CTCCAGCATA 


GTCTATAAGA 


AAGAGGGGGA 


ACAGGTGGAG 


660 


85 


TTCTCCTTCC 


CACTCGCCTT 


TACAGTTGAA 


AAGCTGACGG 


GCAGTGGCGA 


GCTGTGGTGG 


720 


86 


CAGGCGGAGA 


GGGCTTCCTC 


CTCCAAGTCT 


TGGATCACCT 


TTGACCTGAA 


GAACAAGGAA 


780 


87 


GTGTCTGTAA 


AACGGGTTAC 


CCAGGACCCT 


AAGCTCCAGA 


TGGGCAAGAA 


GCTCCCGCTC 


840 


88 


CACCTCACCC 


TGCCCCAGGC 


CTTGCCTCAG 


TATGCTGGCT 


CTGGAAACCT 


CACCCTGGCC 


900 


89 


CTTGAAGCGA 


AAACAGGAAA 


GTTGCATCAG 


GAAGTGAACC 


TGGTGGTGAT 


GAGAGCCACT 


960 


90 


CAGCTCCAGA 


AAAATTTGAC 


CTGTGAGGTG 


TGGGGACCCA 


CCTCCCCTAA 


GCTGATGCTG 


1020 


91 


AGCTTGAAAC 


TGGAGAACAA 


GGAGGCAAAG 


GTCTCGAAGC 


GGGAGAAGCC 


GGTGTGGGTG 


1080 


92 


CTGAACCCTG 


AGGCGGGGAT 


GTGGCAGTGT 


CTGCTGAGTG 


ACTCGGGACA 


GGTCCTGCTG 


1140 


93 


GAATCCAACA 


TCAAGGTTCT 


GCCCACATGG 


TCCACCCCGG 


TGCACGCGGA 


TCCCAAACTC 


1200 


94 


TGCTACTTGC 


TAGATGGAAT 


CCTCTTCATC 


TACGGAGTCA 


TCATCACAGC 


CCTGTACCTG 


1260 


95 


AGAGCAAAAT 


TCAGCAGGAG 


TGCAGAGACT 


GCTGCCAACC 


TGCAGGACCC 


CAACCAGCTC 


1320 


96 


TACAATGAGC 


TCAATCTAGG 


GCGAAGAGAG 


GAATATGACG 


TCTTGGAGAA 


GAAGCGGGCT 


1380 


97 


CGGGATCCAG 


AGATGGGAGG 


CAAACAGCAG 


AGGAGGAGGA 


ACCCCCAGGA 


AGGCGTATAC 


1440 


98 


AATGCACTGC 


AGAAAGACAA 


GATGCCAGAA 


GCCTACAGTG 


AGATCGGCAC 


AAAAGGCGAG 


1500 


99 


AGGCGGAGAG 


GCAAGGGGCA 


CGATGGCCTT 


TACCAGGACA 


GCCACTTCCA 


AGCAGTGCAG 


1560 



100 TTCGGGAACA GAAGAGAGAG AGAAGGTTCA GAACTCACAA GGACCCTTGG GTTAAGAGCC 1620 

101 CGCCCCAAAG GTGAAAGCAC CCAGCAGAGT AGCCAATCCT GTGCCAGCGT CTTCAGCATC 1680 

102 CCCACTCTGT GGAGTCCATG GCCACCCAGT AGCAGCTCCC AGCTCTAA 1728 
104 (2) INFORMATION FOR SEQ ID NO : 2 : 



106 (i) SEQUENCE CHARACTERISTICS: 

107 (A) LENGTH: 1389 base pairs 

108 (B) TYPE: nucleic acid 

109 (C) STRANDEDNESS: double 

110 (D) TOPOLOGY: linear 
112 (ii) MOLECULE TYPE: cDNA 

114 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



116 ATGAACCGGG GAGTCCCTTT TAGGCACTTG CTTCTGGTGC TGCAACTGGC GCTCCTCCCA 60 

117 GCAGCCACTC AGGGAAACAA AGTGGTGCTG GGCAAAAAAG GGGATACAGT GGAACTGACC 120 

118 TGTACAGCTT CCCAGAAGAA GAGCATACAA TTCCACTGGA AAAACTCCAA CCAGATAAAG 180 

119 ATTCTGGGAA ATCAGGGCTC CTTCTTAACT AAAGGTCCAT CCAAGCTGAA TGATCGCGCT 24 0 

120 GACTCAAGAA GAAGCCTTTG GGACCAAGGA AACTTCCCCC TGATCATCAA GAATCTTAAG 300 
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121 ATAGAAGACT CAGATACTTA CATCTGTGAA GTGGAGGACC AGAAGGAGGA GGTGCAATTG 

122 CTAGTGTTCG GATTGACTGC CAACTCTGAC ACCCACCTGC TTCAGGGGCA GAGCCTGACC 
12 3 CTGACCTTGG AGAGCCCCCC TGGTAGTAGC CCCTCAGTGC AATGTAGGAG TCCAAGGGGT 

124 AAAAACATAC AGGGGGGGAA GACCCTCTCC GTGTCTCAGC TGGAGCTCCA GGATAGTGGC 

125 ACCTGGACAT GCACTGTCTT GCAGAACCAG AAGAAGGTGG AGTTCAAAAT AGACATCGTG 

126 GTGCTAGCTT TCCAGAAGGC CTCCAGCATA GTCTATAAGA AAGAGGGGGA ACAGGTGGAG 

127 TTCTCCTTCC CACTCGCCTT TACAGTTGAA AAGCTGACGG GCAGTGGCGA GCTGTGGTGG 
12 8 CAGGCGGAGA GGGCTTCCTC CTCCAAGTCT TGGATCACCT TTGACCTGAA GAACAAGGAA 
12 9 GTGTCTGTAA AACGGGTTAC CCAGGACCCT AAGCTCCAGA TGGGCAAGAA GCTCCCGCTC 

130 CACCTCACCC TGCCCCAGGC CTTGCCTCAG TATGCTGGCT CTGGAAACCT CACCCTGGCC 

131 CTTGAAGCGA AAACAGGAAA GTTGCATCAG GAAGTGAACC TGGTGGTGAT GAGAGCCACT 

132 CAGCTCCAGA AAAATTTGAC CTGTGAGGTG TGGGGACCCA CCTCCCCTAA GCTGATGCTG 

133 AGCTTGAAAC TGGAGAACAA GGAGGCAAAG GTCTCGAAGC GGGAGAAGCC GGTGTGGGTG 

134 CTGAACCCTG AGGCGGGGAT GTGGCAGTGT CTGCTGAGTG ACTCGGGACA GGTCCTGCTG 

135 GAATCCAACA TCAAGGTTCT GCCCACATGG TCCACCCCGG TGCACGCGGA TCCGCAGCTC 

136 TGCTATATCC TGGATGCCAT CCTGTTTTTG TATGGTATTG TCCTTACCCT GCTCTACTGT 

137 CGACTCAAGA TCCAGGTCCG AAAGGCAGAC ATAGCCAGCC GTGAGAAATC AGATGCTGTC 

138 TACACGGGCC TGAACACCCG GAACCAGGAG ACATATGAGA CTCTGAAACA TGAGAAACCA 

139 CCCCAATAG 

141 (2) INFORMATION FOR SEQ ID NO : 3: 

143 (i) SEQUENCE CHARACTERISTICS: 

144 (A) LENGTH: 1599 base pairs 

145 (B) TYPE: nucleic acid 
14 6 (C) STRANDEDNESS : double 
147 (D) TOPOLOGY: linear 

14 9 (ii) MOLECULE TYPE: cDNA 

151 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3: 

153 ATGAACCGGG GAGTCCCTTT TAGGCACTTG CTTCTGGTGC TGCAACTGGC GCTCCTCCCA 

154 GCAGCCACTC AGGGAAACAA AGTGGTGCTG GGCAAAAAAG GGGATACAGT GGAACTGACC 

155 TGTACAGCTT CCCAGAAGAA GAGCATACAA TTCCACTGGA AAAACTCCAA CCAGATAAAG 

156 ATTCTGGGAA ATCAGGGCTC CTTCTTAACT AAAGGTCCAT CCAAGCTGAA TGATCGCGCT 

157 GACTCAAGAA GAAGCCTTTG GGACCAAGGA AACTTCCCCC TGATCATCAA GAATCTTAAG 

158 ATAGAAGACT CAGATACTTA CATCTGTGAA GTGGAGGACC AGAAGGAGGA GGTGCAATTG 

159 CTAGTGTTCG GATTGACTGC CAACTCTGAC ACCCACCTGC TTCAGGGGCA GAGCCTGACC 

160 CTGACCTTGG AGAGCCCCCC TGGTAGTAGC CCCTCAGTGC AATGTAGGAG TCCAAGGGGT 

161 AAAAACATAC AGGGGGGGAA GACCCTCTCC GTGTCTCAGC TGGAGCTCCA GGATAGTGGC 

162 ACCTGGACAT GCACTGTCTT GCAGAACCAG AAGAAGGTGG AGTTCAAAAT AGACATCGTG 
16 3 GTGCTAGCTT TCCAGAAGGC CTCCAGCATA GTCTATAAGA AAGAGGGGGA ACAGGTGGAG 

164 TTCTCCTTCC CACTCGCCTT TACAGTTGAA AAGCTGACGG GCAGTGGCGA GCTGTGGTGG 

165 CAGGCGGAGA GGGCTTCCTC CTCCAAGTCT TGGATCACCT TTGACCTGAA GAACAAGGAA 
16 6 GTGTCTGTAA AACGGGTTAC CCAGGACCCT AAGCTCCAGA TGGGCAAGAA GCTCCCGCTC 

167 CACCTCACCC TGCCCCAGGC CTTGCCTCAG TATGCTGGCT CTGGAAACCT CACCCTGGCC 

168 CTTGAAGCGA AAACAGGAAA GTTGCATCAG GAAGTGAACC TGGTGGTGAT GAGAGCCACT 

169 CAGCTCCAGA AAAATTTGAC CTGTGAGGTG TGGGGACCCA CCTCCCCTAA GCTGATGCTG 

170 AGCTTGAAAC TGGAGAACAA GGAGGCAAAG GTCTCGAAGC GGGAGAAGCC GGTGTGGGTG 

171 CTGAACCCTG AGGCGGGGAT GTGGCAGTGT CTGCTGAGTG ACTCGGGACA GGTCCTGCTG 

172 GAATCCAACA TCAAGGTTCT GCCCACATGG TCCACCCCGG TGCACGCGGA TCCCAAACTC 

173 TGCTACCTGC TGGATGGAAT CCTCTTCATC TATGGTGTCA TTCTCACTGC CTTGTTCCTG 

174 AGAGTGAAGT TCAGCAGGAG CGCAGAGCCC CCCGCGTACC AGCAGGGCCA GAACCAGCTC 



360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1389 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
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17 5 TATAACGAGC TCAATCTAGG ACGAAGAGAG GAGTACGATG TTTTGGACAA GAGACGTGGC 1380 

176 CGGGACCCTG AGATGGGGGG AAAGCCGAGA AGGAAGAACC CTCAGGAAGG CCTGTACAAT 14 4 0 

177 GAACTGCAGA AAGATAAGAT GGCGGAGGCC TACAGTGAGA TTGGGATGAA AGGCGAGCGC 1500 

178 CGGAGGGGCA AGGGGCACGA TGGCCTTTAC CAGGGTCTCA GTACAGCCAC CAAGGACACC 1560 

179 TACGACGCCC TTCACATGCA GGCCCTGCCC CCTCGCTAA 1599 
181 (2) INFORMATION FOR SEQ ID NO : 4 : 

183 (i) SEQUENCE CHARACTERISTICS: 

184 (A) LENGTH: 575 amino acids 

185 (B) TYPE: amino acid 

186 (C) STRANDEDNESS: single 

187 (D) TOPOLOGY: linear 
189 (ii) MOLECULE TYPE: protein 

191 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

193 Met Asn Arg Gly Val Pro Phe Arg His Leu Leu Leu Val Leu Gin Leu 

194 15 10 15 

195 Ala Leu Leu Pro Ala Ala Thr Gin Gly Asn Lys Val Val Leu Gly Lys 

196 20 25 30 

197 Lys Gly Asp Thr Val Glu Leu Thr Cys Thr Ala Ser Gin Lys Lys Ser 

198 35 40 45 

199 lie Gin Phe His Trp Lys Asn Ser Asn Gin lie Lys lie Leu Gly Asn 

200 50 55 60 

201 Gin Gly Ser Phe Leu Thr Lys Gly Pro Ser Lys Leu Asn Asp Arg Ala 

202 65 70 75 80 

203 Asp Ser Arg Arg Ser Leu Trp Asp Gin Gly Asn Phe Pro Leu lie lie 

204 85 90 95 

205 Lys Asn Leu Lys lie Glu Asp Ser Asp Thr Tyr lie Cys Glu Val Glu 

206 100 105 110 

207 Asp Gin Lys Glu Glu Val Gin Leu Leu Val Phe Gly Leu Thr Ala Asn 

208 115 120 125 

209 Ser Asp Thr His Leu Leu Gin Gly Gin Ser Leu Thr Leu Thr Leu Glu 

210 130 135 140 

211 Ser Pro Pro Gly Ser Ser Pro Ser Val Gin Cys Arg Ser Pro Arg Gly 

212 145 150 155 160 

213 Lys Asn lie Gin Gly Gly Lys Thr Leu Ser Val Ser Gin Leu Glu Leu 

214 165 170 175 

215 Gin Asp Ser Gly Thr Trp Thr Cys Thr Val Leu Gin Asn Gin Lys Lys 

216 180 185 190 

217 Val Glu Phe Lys He Asp He Val Val Leu Ala Phe Gin Lys Ala Ser 

218 195 200 205 

219 Ser He Val Tyr Lys Lys Glu Gly Glu Gin Val Glu Phe Ser Phe Pro 

220 210 215 220 

221 Leu Ala Phe Thr Val Glu Lys Leu Thr Gly Ser Gly Glu Leu Trp Trp 

222 225 230 235 240 
22 3 Gin Ala Glu Arg Ala Ser Ser Ser Lys Ser Trp He Thr Phe Asp Leu 

224 245 250 255 

225 Lys Asn Lys Glu Val Ser Val Lys Arg Val Thr Gin Asp Pro Lys Leu 

226 260 265 270 

227 Gin Met Gly Lys Lys Leu Pro Leu His Leu Thr Leu Pro Gin Ala Leu 

228 275 280 285 
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229 


Pro 


Gin 


Tvr 


Ala 


Glv 

vjxy 


Ser 


Glv 


As n 


Leu 


Thr 


Leu 


Ala 


Leu 


Glu 


Ala 


Lys 


230 




290 










295 










300 










231 


Thr 


Glv 
oxy 


Lys 


Leu 


His 


Gin 


Glu 


Val 


Asn 


Leu 


Val 


Val 


Met 


Arg 


Ala 


Thr 


232 


305 










310 










315 










320 




Cl 1 n 

X 11 


Leu 


Cl 1 n 

Vjj _L 11 


Lys 


as n 


Leu 


X 11X 


\_ y & 


OXU 


val 


irp 


ox y 


it X LJ 


Thr 


Ser 


IrX LJ 


2 34 










325 










330 










335 




9 ^ R 

£t J J 




Leu 


M<=»t 

1 '1C u 


XjC li 


OCX 


XjC- Li 


T XT c 


XjC Li 


r* i ii 

VjX Li 


A c n 
noil 


Xj jr o 


\J X Li 


Alp 


Lys 


Val 


Car 

OCX 


2 36 








340 










34 5 










350 






z j / 


Lys 


A r /-T 


Cl 1 n 


T.vrc 
xjy o 


IT X \J 


vax 


irp 


vax 


T.oi i 

XjC Li 


7\ en 

noil 


lr X \J 


VjX LI 


Al ^ 
rVX a 


Gly Met 


T 1 r "pi 

irp 


238 






355 










360 










365 








£i O Zs 


Gin 


Cys 


Leu 


Leu 


Ser 


Asp 


Ser 


Gly 


Gin 


Val 


Leu 


Leu 


Glu 


Ser 


Asn 


lie 


240 




370 










375 










380 










941 


Ly s 


vdl 


T.oi i 
J-it; U. 


Dr A 


X 11X 


irp 


Cot 
OCX 


X 1 IX 


TD -y~ /^V 
IT X W 


Va 1 

V Cl X 




Ala 


nop 


Pro 


Lys 


T.^i i 

XjC Li 


242 


385 










390 










395 










400 


94 ^ 

*i J 


_y o 


r P\7T~ 

xyr 


XjC; Li. 


JJC Li 


Act* 


x y 


Tl & 

X X c 


-LjC Li 


IT 11C 


Tip 
X X ~ 


Tvr 

iyr 




Val 


Tl o 
X X c 


Tip 

X X c 


Thr 


244 










405 










410 










4 1 S 




94 S 

Z. *t -J 


nXCl 


uc li 


xyx 


Leu 


A rrr 


Ala 


Lys 


Phe 


Ser 


A rrr 


cpr 

OCX 


Ala 


Glu 


Thr 


Ala 


Ala 


246 








420 










425 










4 ? n 






94 7 


Aon 


T.^i i 
XjC u 




A cn 


i X \J 


A c n 

noil 


VJ X 11 


J_l" Li 


xyr 


Sen 
no 11 


vXU 


XjC Li 


Asn 


Leu 


uiy 


A TCT 

/\x y 


248 






435 










440 










445 








94 Q 


A r cj 
r\. X y 


UlU 


(Z 1 ii 




Tien 


vax 


XjC Li 


Cl "1 n 


T.VO 
Xjjt £> 




rvx y 


Al a 
nXd 


Arg 


ASp 


ri u 


Glu 


9 




4 










4 S S 

t -j j 










4 60 










9^1 


1 "1C L. 






T.vC 


Cl I n 

Lj X ii 




A TCT 

rix y 


/■vx y 


A r fr 
r\X y 


Asn 


Pro 


Gin 


Glu 




V a. X 


Tvr 
xyx 


9 R9 


4 fi S 










4 70 










4 7 5 










4 80 


9 R ^ 

Z J J 


As n 


Alp 


Leu 


Vj X 11 


Lys 


no L) 


Lys 


11 C L. 


IT X KJ 


vjt X Li 


Ala 
rVX d 


xyr 


ocx 




Tip 
X X c 


Gly 


2 54 










485 










490 










45 J 




2 55 


Thr 


Lys 


Gly 


Glu 


Arg 


Arg 


Arg 


Gly 


Lys 


Gly 


His 


Asp 


Gly 


Leu 


Tvr 

xyx 


Gin 


9 Rfi 

Z» J D 








500 










505 










n 






9 S7 

Z. J / 


Asp 


Ser 


His 


Phe 


Gin 


Ala 


Val 


Gin 


Phe 


Gly 


A a n 

t\ O 11 


Arg 


A TTCT 

t\JL y 


Glu 


A TO 


Cl "1 ii 


9 Rft 






515 










520 










JZ J 








9 

Z. J -? 




Ser 


Glu 


Leu 


Thr 


Arg 


Thr 


Leu 


Gly 


Leu 


Arg 


al a 
r\X a. 


A rn 

arg 


Pro 


Lys 


Cl 1 v 






530 










535 










S4 0 

*r VJ 










ZOl 


V3lU 


Ser 


Thr 


Gin 


Gin 


Ser 


Ser 


Gin 


Ser 


Cys 


Ala 


Ser 


Val 


Phe 


Ser 


lie 


9fi 9 

Z> D Z. 


^4 S 










550 










-j j 










J D \J 


96 ^ 
z. O J 


I: X U 


Thr 


Leu 


Trp 


Ser 


Pro 


Trp 


Pro 


Pro 


Ser 


Cpr 

OCX 


Cpr 
OCX 


Cpr 

OCX , 


Gin 


Leu 




264 










565 










570 










575 




9fi fi 


( Z ) 


INFORMATION FOR SEQ : 


ID NO: 5 


















268 




(i) 


SEQUENCE CHARACTERISTICS: 
















269 






(A) 


LENGTH 


: 462 amino acids 














270 






(B) 


TYPE: amino acid 


















271 






(C) 


STRANDEDNESS : single 
















272 






<D) 


TOPOLOGY: linear 


















274 




(ii) 


MOLECULE TYPE: protein 


















276 




(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO 


: 5: 












278 


Met 


Asn 


Arg 


Gly Val 


Pro 


Phe 


Arg 


His 


Leu 


Leu 


Leu 


Val 


Leu 


Gin 


Leu 


279 


1 








5 










10 










15 




280 


Ala 


Leu 


Leu 


Pro 


Ala 


Ala 


Thr 


Gin 


Gly 


Asn 


Lys 


Val 


Val 


Leu 


Gly 


Lys 


281 








20 










25 










30 






282 


Lys 


Gly 


Asp 


Thr 


Val 


Glu 


Leu 


Thr 


Cys 


Thr 


Ala 


Ser 


Gin 


Lys 


Lys 


Ser 
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L:12 M:220 C: Keyword misspelled or invalid format, [(ii) TITLE OF INVENTION:] 
L:32 M:220 C: Keyword misspelled or invalid format, [(A) APPLICATION NUMBER : ] 
L:33 M:220 C: Keyword misspelled or invalid format, [(B) FILING DATE:] 
L:36 M:220 C: Keyword misspelled or invalid format, [(vii) PRIOR APPLICATION DATA:] 
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